Datadriven Analys och Uppföljning av KTHs Forskning
2025-12-05
This years version of ABM was released about a week ago
Recently released beta version of topics based KTH Research Information
POC for the KTH Indicators dashboard based on consolidated indicators collected from across KTH.
Tests and prep for GDP 2.0 (Gemensamma dataprojektet) - new standard for Swedish project data
Work to use OpenAlex to update DiVA, and to construct bibliometric database
Changes in ABM 2024
The DAUF project now harvests DiVA publication data using the OAI-PMH protocol which regularly updates a single file duckdb database, openly available from object storage:
https://data.bibliometrics.lib.kth.se/kthcorpus/oai.db
The database with the harvested information is currently about 4.4 GB large.It is reqularly updated and contains MODS and JSON representations of “all-kth” DiVA records.
+--------------------------------+ | | | Data Sources | | | +--------------------------------+ | Clean / Crosscheck / Transform v +--------------------------------+ | | | Curated Data | | | +--------------------------------+ | Write / POST v +--------------------------------+ | | | Object Storage (minio) | | | +--------------------------------+ | Read / GET v +--------------------------------+ | | | Data Consumer / Client | | | +--------------------------------+
GDP (Gemensamma data för projekt) is an effort of a number of Swedish research funders to create a common data model for project data. The five funding agencies Energimyndigheten, Formas, Forte, Vetenskapsrådet and Vinnova is developing a standard which enables sharing of open data about fundings and related information.
The standard is developed in cooperation with a reference group including universities and other organisations within the university sector, KTH is a participant in the reference group.
We have participated in reference group coordinated by SUNET for the GDP efforts in collaboration with partners, including Vinnova and other universities
An R package has been developed with a client - https://github.com/KTH-Library/gdp
Regular harvesting of data from the API is now available from object storage “minio” at KTH: https://data.bibliometrics.lib.kth.se/projects/gdp/gdp.db
The DAUF project now harvests DiVA publication data using the OAI-PMH protocol which regularly updates a single file duckdb database, openly available from object storage:
https://data.bibliometrics.lib.kth.se/kthcorpus/oai.db
The database with the harvested information is currently about 4.4 GB large.It is reqularly updated and contains MODS and JSON representations of “all-kth” DiVA records.
Related activities
KTH Cris/Rims
KTH Insights/datastyrning (MS Fabric/Power BI)
Please provide your input in chat or verbally.
If you prefer to give your feedback later or come up with questions after this demo, you are always welcome to email us at biblioteket@kth.se.
DAUF - Demo 11 - 2025-12-05